Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 2075427 |
| Missing cells | 17761579 |
| Missing cells (%) | 29.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 459.2 MiB |
| Average record size in memory | 232.0 B |
Variable types
| DateTime | 2 |
|---|---|
| Categorical | 6 |
| Text | 13 |
| Numeric | 8 |
NUMBER OF PEDESTRIANS KILLED is highly imbalanced (99.6%) | Imbalance |
NUMBER OF CYCLIST INJURED is highly imbalanced (92.3%) | Imbalance |
NUMBER OF CYCLIST KILLED is highly imbalanced (99.9%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 4 is highly imbalanced (90.8%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 5 is highly imbalanced (89.9%) | Imbalance |
BOROUGH has 645746 (31.1%) missing values | Missing |
ZIP CODE has 645996 (31.1%) missing values | Missing |
LATITUDE has 233626 (11.3%) missing values | Missing |
LONGITUDE has 233626 (11.3%) missing values | Missing |
LOCATION has 233626 (11.3%) missing values | Missing |
ON STREET NAME has 440569 (21.2%) missing values | Missing |
CROSS STREET NAME has 784436 (37.8%) missing values | Missing |
OFF STREET NAME has 1727231 (83.2%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 2 has 321736 (15.5%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 3 has 1927163 (92.9%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 4 has 2041953 (98.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 5 has 2066358 (99.6%) missing values | Missing |
VEHICLE TYPE CODE 2 has 396691 (19.1%) missing values | Missing |
VEHICLE TYPE CODE 3 has 1932530 (93.1%) missing values | Missing |
VEHICLE TYPE CODE 4 has 2043115 (98.4%) missing values | Missing |
VEHICLE TYPE CODE 5 has 2066635 (99.6%) missing values | Missing |
LATITUDE is highly skewed (γ1 = -20.43042564) | Skewed |
NUMBER OF PERSONS KILLED is highly skewed (γ1 = 33.71743399) | Skewed |
NUMBER OF MOTORIST KILLED is highly skewed (γ1 = 54.74414747) | Skewed |
COLLISION_ID has unique values | Unique |
NUMBER OF PERSONS INJURED has 1601221 (77.2%) zeros | Zeros |
NUMBER OF PERSONS KILLED has 2072415 (99.9%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 1962919 (94.6%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 1772939 (85.4%) zeros | Zeros |
NUMBER OF MOTORIST KILLED has 2074246 (99.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-26 20:04:45.483578 |
|---|---|
| Analysis finished | 2024-03-26 20:10:07.981193 |
| Duration | 5 minutes and 22.5 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
CRASH DATE
Date
| Distinct | 4283 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 MiB |
| Minimum | 2012-07-01 00:00:00 |
|---|---|
| Maximum | 2024-03-22 00:00:00 |
CRASH TIME
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 MiB |
| Minimum | 2024-03-26 00:00:00 |
|---|---|
| Maximum | 2024-03-26 23:59:00 |
BOROUGH
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 645746 |
| Missing (%) | 31.1% |
| Memory size | 15.8 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.4541209 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10657015 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | BRONX |
| 4th row | BROOKLYN |
| 5th row | MANHATTAN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 454727 | |
| QUEENS | 383365 | |
| MANHATTAN | 320242 | |
| BRONX | 211335 | 10.2% |
| STATEN ISLAND | 60012 | 2.9% |
| (Missing) | 645746 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 454727 | |
| queens | 383365 | |
| manhattan | 320242 | |
| bronx | 211335 | |
| staten | 60012 | 4.0% |
| island | 60012 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1809935 | |
| O | 1120789 | |
| A | 1080750 | |
| E | 826742 | 7.8% |
| T | 760508 | 7.1% |
| R | 666062 | 6.2% |
| B | 666062 | 6.2% |
| L | 514739 | 4.8% |
| S | 503389 | 4.7% |
| Y | 454727 | 4.3% |
| Other values (9) | 2253312 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10597003 | |
| Space Separator | 60012 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1809935 | |
| O | 1120789 | |
| A | 1080750 | |
| E | 826742 | 7.8% |
| T | 760508 | 7.2% |
| R | 666062 | 6.3% |
| B | 666062 | 6.3% |
| L | 514739 | 4.9% |
| S | 503389 | 4.8% |
| Y | 454727 | 4.3% |
| Other values (8) | 2193300 |
Space Separator
| Value | Count | Frequency (%) |
| 60012 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10597003 | |
| Common | 60012 | 0.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1809935 | |
| O | 1120789 | |
| A | 1080750 | |
| E | 826742 | 7.8% |
| T | 760508 | 7.2% |
| R | 666062 | 6.3% |
| B | 666062 | 6.3% |
| L | 514739 | 4.9% |
| S | 503389 | 4.8% |
| Y | 454727 | 4.3% |
| Other values (8) | 2193300 |
Common
| Value | Count | Frequency (%) |
| 60012 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10657015 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1809935 | |
| O | 1120789 | |
| A | 1080750 | |
| E | 826742 | 7.8% |
| T | 760508 | 7.1% |
| R | 666062 | 6.2% |
| B | 666062 | 6.2% |
| L | 514739 | 4.8% |
| S | 503389 | 4.7% |
| Y | 454727 | 4.3% |
| Other values (9) | 2253312 |
ZIP CODE
Text
MISSING 
| Distinct | 235 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 645996 |
| Missing (%) | 31.1% |
| Memory size | 15.8 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 7147155 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 11208 |
|---|---|
| 2nd row | 11233 |
| 3rd row | 10475 |
| 4th row | 11207 |
| 5th row | 10017 |
| Value | Count | Frequency (%) |
| 11207 | 27789 | 1.9% |
| 11236 | 19259 | 1.3% |
| 11101 | 19220 | 1.3% |
| 11203 | 18372 | 1.3% |
| 11234 | 18011 | 1.3% |
| 11385 | 17924 | 1.3% |
| 11208 | 17312 | 1.2% |
| 10019 | 17258 | 1.2% |
| 11212 | 17236 | 1.2% |
| 11201 | 17146 | 1.2% |
| Other values (224) | 1239862 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2772280 | |
| 0 | 1267960 | |
| 2 | 835731 | 11.7% |
| 3 | 624685 | 8.7% |
| 4 | 511341 | 7.2% |
| 6 | 319786 | 4.5% |
| 5 | 281189 | 3.9% |
| 7 | 243220 | 3.4% |
| 8 | 151585 | 2.1% |
| 9 | 139168 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7146945 | |
| Space Separator | 210 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2772280 | |
| 0 | 1267960 | |
| 2 | 835731 | 11.7% |
| 3 | 624685 | 8.7% |
| 4 | 511341 | 7.2% |
| 6 | 319786 | 4.5% |
| 5 | 281189 | 3.9% |
| 7 | 243220 | 3.4% |
| 8 | 151585 | 2.1% |
| 9 | 139168 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7147155 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2772280 | |
| 0 | 1267960 | |
| 2 | 835731 | 11.7% |
| 3 | 624685 | 8.7% |
| 4 | 511341 | 7.2% |
| 6 | 319786 | 4.5% |
| 5 | 281189 | 3.9% |
| 7 | 243220 | 3.4% |
| 8 | 151585 | 2.1% |
| 9 | 139168 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7147155 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2772280 | |
| 0 | 1267960 | |
| 2 | 835731 | 11.7% |
| 3 | 624685 | 8.7% |
| 4 | 511341 | 7.2% |
| 6 | 319786 | 4.5% |
| 5 | 281189 | 3.9% |
| 7 | 243220 | 3.4% |
| 8 | 151585 | 2.1% |
| 9 | 139168 | 1.9% |
LATITUDE
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 126594 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 233626 |
| Missing (%) | 11.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.627693 |
| Minimum | 0 |
|---|---|
| Maximum | 43.344444 |
| Zeros | 4360 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.596622 |
| Q1 | 40.6678 |
| median | 40.72083 |
| Q3 | 40.769592 |
| 95-th percentile | 40.86205 |
| Maximum | 43.344444 |
| Range | 43.344444 |
| Interquartile range (IQR) | 0.101792 |
Descriptive statistics
| Standard deviation | 1.9806568 |
|---|---|
| Coefficient of variation (CV) | 0.048751397 |
| Kurtosis | 416.08064 |
| Mean | 40.627693 |
| Median Absolute Deviation (MAD) | 0.051354 |
| Skewness | -20.430426 |
| Sum | 74828126 |
| Variance | 3.9230014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4360 | 0.2% |
| 40.861862 | 883 | < 0.1% |
| 40.696033 | 762 | < 0.1% |
| 40.8047 | 692 | < 0.1% |
| 40.608757 | 671 | < 0.1% |
| 40.798256 | 627 | < 0.1% |
| 40.759308 | 622 | < 0.1% |
| 40.6960346 | 587 | < 0.1% |
| 40.675735 | 557 | < 0.1% |
| 40.658577 | 520 | < 0.1% |
| Other values (126584) | 1831520 | |
| (Missing) | 233626 | 11.3% |
| Value | Count | Frequency (%) |
| 0 | 4360 | |
| 30.78418 | 1 | < 0.1% |
| 34.783634 | 1 | < 0.1% |
| 40.4989488 | 2 | < 0.1% |
| 40.4991346 | 1 | < 0.1% |
| 40.49931 | 1 | < 0.1% |
| 40.4994787 | 1 | < 0.1% |
| 40.499659 | 1 | < 0.1% |
| 40.49971 | 1 | < 0.1% |
| 40.49984 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 43.344444 | 1 | < 0.1% |
| 42.64154 | 1 | < 0.1% |
| 42.318317 | 1 | < 0.1% |
| 42.107204 | 1 | < 0.1% |
| 41.91661 | 1 | < 0.1% |
| 41.34796 | 1 | < 0.1% |
| 41.258785 | 1 | < 0.1% |
| 41.12615 | 5 | |
| 41.12421 | 1 | < 0.1% |
| 41.061634 | 2 | < 0.1% |
LONGITUDE
Real number (ℝ)
MISSING 
| Distinct | 98351 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 233626 |
| Missing (%) | 11.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.752129 |
| Minimum | -201.35999 |
|---|---|
| Maximum | 0 |
| Zeros | 4360 |
| Zeros (%) | 0.2% |
| Negative | 1837441 |
| Negative (%) | 88.5% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | -201.35999 |
|---|---|
| 5-th percentile | -74.03607 |
| Q1 | -73.97484 |
| median | -73.92726 |
| Q3 | -73.866731 |
| 95-th percentile | -73.763239 |
| Maximum | 0 |
| Range | 201.35999 |
| Interquartile range (IQR) | 0.1081089 |
Descriptive statistics
| Standard deviation | 3.7233454 |
|---|---|
| Coefficient of variation (CV) | -0.050484581 |
| Kurtosis | 440.66 |
| Mean | -73.752129 |
| Median Absolute Deviation (MAD) | 0.0526217 |
| Skewness | 16.099628 |
| Sum | -1.3583675 × 108 |
| Variance | 13.863301 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4360 | 0.2% |
| -73.89063 | 763 | < 0.1% |
| -73.91282 | 719 | < 0.1% |
| -73.98453 | 699 | < 0.1% |
| -74.038086 | 672 | < 0.1% |
| -73.89686 | 657 | < 0.1% |
| -73.91243 | 654 | < 0.1% |
| -73.9845292 | 587 | < 0.1% |
| -73.94476 | 583 | < 0.1% |
| -73.9112 | 576 | < 0.1% |
| Other values (98341) | 1831531 | |
| (Missing) | 233626 | 11.3% |
| Value | Count | Frequency (%) |
| -201.35999 | 1 | < 0.1% |
| -201.23706 | 105 | |
| -89.13527 | 1 | < 0.1% |
| -86.76847 | 1 | < 0.1% |
| -79.61955 | 1 | < 0.1% |
| -79.00183 | 1 | < 0.1% |
| -76.2634 | 1 | < 0.1% |
| -76.02163 | 1 | < 0.1% |
| -74.742 | 7 | < 0.1% |
| -74.25496 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4360 | |
| -32.768513 | 16 | < 0.1% |
| -47.209625 | 3 | < 0.1% |
| -73.66301 | 1 | < 0.1% |
| -73.70055 | 2 | < 0.1% |
| -73.700584 | 11 | < 0.1% |
| -73.7005968 | 10 | < 0.1% |
| -73.70061 | 4 | < 0.1% |
| -73.70071 | 4 | < 0.1% |
| -73.70073 | 1 | < 0.1% |
LOCATION
Text
MISSING 
| Distinct | 283006 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 233626 |
| Missing (%) | 11.3% |
| Memory size | 15.8 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 22.779989 |
| Min length | 10 |
Characters and Unicode
| Total characters | 41956206 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 6 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 155498 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | (40.667202, -73.8665) |
|---|---|
| 2nd row | (40.683304, -73.917274) |
| 3rd row | (40.709183, -73.956825) |
| 4th row | (40.86816, -73.83148) |
| 5th row | (40.67172, -73.8971) |
| Value | Count | Frequency (%) |
| 0.0 | 8720 | 0.2% |
| 40.861862 | 883 | < 0.1% |
| 73.89063 | 763 | < 0.1% |
| 40.696033 | 762 | < 0.1% |
| 73.91282 | 719 | < 0.1% |
| 73.98453 | 699 | < 0.1% |
| 40.8047 | 692 | < 0.1% |
| 74.038086 | 672 | < 0.1% |
| 40.608757 | 671 | < 0.1% |
| 73.89686 | 657 | < 0.1% |
| Other values (224934) | 3668364 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 4595577 | |
| 4 | 3980471 | 9.5% |
| . | 3683602 | 8.8% |
| 3 | 3498540 | 8.3% |
| 0 | 3400841 | 8.1% |
| 9 | 2700094 | 6.4% |
| 8 | 2648683 | 6.3% |
| 6 | 2616640 | 6.2% |
| 5 | 2094509 | 5.0% |
| ( | 1841801 | 4.4% |
| Other values (6) | 10895448 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29067959 | |
| Other Punctuation | 5525403 | 13.2% |
| Open Punctuation | 1841801 | 4.4% |
| Space Separator | 1841801 | 4.4% |
| Close Punctuation | 1841801 | 4.4% |
| Dash Punctuation | 1837441 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 4595577 | |
| 4 | 3980471 | |
| 3 | 3498540 | |
| 0 | 3400841 | |
| 9 | 2700094 | |
| 8 | 2648683 | |
| 6 | 2616640 | |
| 5 | 2094509 | |
| 2 | 1784398 | 6.1% |
| 1 | 1748206 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3683602 | |
| , | 1841801 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1841801 |
Space Separator
| Value | Count | Frequency (%) |
| 1841801 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1841801 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1837441 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 41956206 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 4595577 | |
| 4 | 3980471 | 9.5% |
| . | 3683602 | 8.8% |
| 3 | 3498540 | 8.3% |
| 0 | 3400841 | 8.1% |
| 9 | 2700094 | 6.4% |
| 8 | 2648683 | 6.3% |
| 6 | 2616640 | 6.2% |
| 5 | 2094509 | 5.0% |
| ( | 1841801 | 4.4% |
| Other values (6) | 10895448 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41956206 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 4595577 | |
| 4 | 3980471 | 9.5% |
| . | 3683602 | 8.8% |
| 3 | 3498540 | 8.3% |
| 0 | 3400841 | 8.1% |
| 9 | 2700094 | 6.4% |
| 8 | 2648683 | 6.3% |
| 6 | 2616640 | 6.2% |
| 5 | 2094509 | 5.0% |
| ( | 1841801 | 4.4% |
| Other values (6) | 10895448 |
ON STREET NAME
Text
MISSING 
| Distinct | 18410 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 440569 |
| Missing (%) | 21.2% |
| Memory size | 15.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 29.630325 |
| Min length | 2 |
Characters and Unicode
| Total characters | 48441374 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6537 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | WHITESTONE EXPRESSWAY |
|---|---|
| 2nd row | QUEENSBORO BRIDGE UPPER |
| 3rd row | THROGS NECK BRIDGE |
| 4th row | SARATOGA AVENUE |
| 5th row | MAJOR DEEGAN EXPRESSWAY RAMP |
| Value | Count | Frequency (%) |
| avenue | 608264 | 16.1% |
| street | 520901 | 13.8% |
| east | 153481 | 4.1% |
| boulevard | 127014 | 3.4% |
| west | 114792 | 3.0% |
| parkway | 74643 | 2.0% |
| road | 68123 | 1.8% |
| expressway | 63293 | 1.7% |
| island | 30410 | 0.8% |
| queens | 27154 | 0.7% |
| Other values (5393) | 1983965 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27562630 | ||
| E | 3672854 | 7.6% |
| A | 1951050 | 4.0% |
| T | 1831929 | 3.8% |
| R | 1669600 | 3.4% |
| N | 1427915 | 2.9% |
| S | 1407885 | 2.9% |
| U | 977757 | 2.0% |
| O | 868930 | 1.8% |
| V | 852133 | 1.8% |
| Other values (65) | 6218691 | 12.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 27562630 | |
| Uppercase Letter | 19575164 | |
| Decimal Number | 1174050 | 2.4% |
| Lowercase Letter | 118214 | 0.2% |
| Other Punctuation | 4644 | < 0.1% |
| Open Punctuation | 3250 | < 0.1% |
| Close Punctuation | 3245 | < 0.1% |
| Dash Punctuation | 175 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
| Control | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 3672854 | |
| A | 1951050 | |
| T | 1831929 | |
| R | 1669600 | 8.5% |
| N | 1427915 | 7.3% |
| S | 1407885 | 7.2% |
| U | 977757 | 5.0% |
| O | 868930 | 4.4% |
| V | 852133 | 4.4% |
| L | 642960 | 3.3% |
| Other values (16) | 4272151 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15891 | |
| r | 10464 | 8.9% |
| n | 9918 | 8.4% |
| a | 9880 | 8.4% |
| t | 8654 | 7.3% |
| s | 7260 | 6.1% |
| o | 6963 | 5.9% |
| y | 5733 | 4.8% |
| l | 5459 | 4.6% |
| d | 4582 | 3.9% |
| Other values (16) | 33410 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 267131 | |
| 3 | 132818 | |
| 2 | 131224 | |
| 4 | 111253 | |
| 5 | 108833 | |
| 6 | 95492 | 8.1% |
| 8 | 88322 | 7.5% |
| 7 | 86660 | 7.4% |
| 9 | 77421 | 6.6% |
| 0 | 74896 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3444 | |
| / | 1062 | 22.9% |
| & | 63 | 1.4% |
| ' | 37 | 0.8% |
| # | 16 | 0.3% |
| , | 16 | 0.3% |
| @ | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27562630 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3250 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3245 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 175 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28747996 | |
| Latin | 19693378 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 3672854 | |
| A | 1951050 | |
| T | 1831929 | |
| R | 1669600 | 8.5% |
| N | 1427915 | 7.3% |
| S | 1407885 | 7.1% |
| U | 977757 | 5.0% |
| O | 868930 | 4.4% |
| V | 852133 | 4.3% |
| L | 642960 | 3.3% |
| Other values (42) | 4390365 |
Common
| Value | Count | Frequency (%) |
| 27562630 | ||
| 1 | 267131 | 0.9% |
| 3 | 132818 | 0.5% |
| 2 | 131224 | 0.5% |
| 4 | 111253 | 0.4% |
| 5 | 108833 | 0.4% |
| 6 | 95492 | 0.3% |
| 8 | 88322 | 0.3% |
| 7 | 86660 | 0.3% |
| 9 | 77421 | 0.3% |
| Other values (13) | 86212 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48441374 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 27562630 | ||
| E | 3672854 | 7.6% |
| A | 1951050 | 4.0% |
| T | 1831929 | 3.8% |
| R | 1669600 | 3.4% |
| N | 1427915 | 2.9% |
| S | 1407885 | 2.9% |
| U | 977757 | 2.0% |
| O | 868930 | 1.8% |
| V | 852133 | 1.8% |
| Other values (65) | 6218691 | 12.8% |
MISSING 
| Distinct | 20236 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 784436 |
| Missing (%) | 37.8% |
| Memory size | 15.8 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 22.706216 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29313520 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 6201 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 20 AVENUE |
|---|---|
| 2nd row | DECATUR STREET |
| 3rd row | EAST 43 STREET |
| 4th row | EAST GATE PLAZA |
| 5th row | west 80 street -west 81 street |
| Value | Count | Frequency (%) |
| avenue | 565307 | 19.8% |
| street | 459527 | 16.1% |
| east | 112172 | 3.9% |
| west | 71155 | 2.5% |
| boulevard | 68647 | 2.4% |
| road | 55544 | 1.9% |
| place | 33946 | 1.2% |
| parkway | 26605 | 0.9% |
| 3 | 18757 | 0.7% |
| park | 17426 | 0.6% |
| Other values (5483) | 1426325 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14115616 | ||
| E | 2937153 | 10.0% |
| T | 1453458 | 5.0% |
| A | 1419427 | 4.8% |
| R | 1147248 | 3.9% |
| N | 1074756 | 3.7% |
| S | 988831 | 3.4% |
| U | 777244 | 2.7% |
| V | 708819 | 2.4% |
| O | 578382 | 2.0% |
| Other values (66) | 4112586 | 14.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 14115616 | |
| Uppercase Letter | 14063132 | |
| Decimal Number | 1070577 | 3.7% |
| Lowercase Letter | 63842 | 0.2% |
| Other Punctuation | 314 | < 0.1% |
| Dash Punctuation | 27 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
| Control | 2 | < 0.1% |
| Math Symbol | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2937153 | |
| T | 1453458 | |
| A | 1419427 | |
| R | 1147248 | 8.2% |
| N | 1074756 | 7.6% |
| S | 988831 | 7.0% |
| U | 777244 | 5.5% |
| V | 708819 | 5.0% |
| O | 578382 | 4.1% |
| L | 437603 | 3.1% |
| Other values (16) | 2540211 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11924 | |
| t | 6629 | |
| a | 6271 | |
| r | 5269 | 8.3% |
| n | 4531 | 7.1% |
| s | 4170 | 6.5% |
| o | 3075 | 4.8% |
| v | 2968 | 4.6% |
| u | 2602 | 4.1% |
| l | 2297 | 3.6% |
| Other values (16) | 14106 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 237281 | |
| 2 | 126066 | |
| 3 | 117595 | |
| 4 | 96616 | |
| 5 | 96306 | |
| 8 | 85041 | 7.9% |
| 7 | 84929 | 7.9% |
| 6 | 84387 | 7.9% |
| 9 | 73436 | 6.9% |
| 0 | 68920 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 130 | |
| . | 74 | |
| & | 53 | |
| ' | 51 | 16.2% |
| ? | 3 | 1.0% |
| , | 3 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 14115616 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 27 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Control
| Value | Count | Frequency (%) |
| | 2 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| � | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15186546 | |
| Latin | 14126974 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2937153 | |
| T | 1453458 | |
| A | 1419427 | |
| R | 1147248 | 8.1% |
| N | 1074756 | 7.6% |
| S | 988831 | 7.0% |
| U | 777244 | 5.5% |
| V | 708819 | 5.0% |
| O | 578382 | 4.1% |
| L | 437603 | 3.1% |
| Other values (42) | 2604053 |
Common
| Value | Count | Frequency (%) |
| 14115616 | ||
| 1 | 237281 | 1.6% |
| 2 | 126066 | 0.8% |
| 3 | 117595 | 0.8% |
| 4 | 96616 | 0.6% |
| 5 | 96306 | 0.6% |
| 8 | 85041 | 0.6% |
| 7 | 84929 | 0.6% |
| 6 | 84387 | 0.6% |
| 9 | 73436 | 0.5% |
| Other values (14) | 69273 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29313519 | |
| Specials | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 14115616 | ||
| E | 2937153 | 10.0% |
| T | 1453458 | 5.0% |
| A | 1419427 | 4.8% |
| R | 1147248 | 3.9% |
| N | 1074756 | 3.7% |
| S | 988831 | 3.4% |
| U | 777244 | 2.7% |
| V | 708819 | 2.4% |
| O | 578382 | 2.0% |
| Other values (65) | 4112585 | 14.0% |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
OFF STREET NAME
Text
MISSING 
| Distinct | 225845 |
|---|---|
| Distinct (%) | 64.9% |
| Missing | 1727231 |
| Missing (%) | 83.2% |
| Memory size | 15.8 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 36.021158 |
| Min length | 8 |
Characters and Unicode
| Total characters | 12542423 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 176197 ? |
|---|---|
| Unique (%) | 50.6% |
Sample
| 1st row | 1211 LORING AVENUE |
|---|---|
| 2nd row | 344 BAYCHESTER AVENUE |
| 3rd row | 2047 PITKIN AVENUE |
| 4th row | 480 DEAN STREET |
| 5th row | 878 FLATBUSH AVENUE |
| Value | Count | Frequency (%) |
| avenue | 137975 | 11.9% |
| street | 125856 | 10.9% |
| east | 33204 | 2.9% |
| west | 23966 | 2.1% |
| boulevard | 22127 | 1.9% |
| road | 16430 | 1.4% |
| lot | 7881 | 0.7% |
| parking | 7267 | 0.6% |
| of | 6949 | 0.6% |
| parkway | 6943 | 0.6% |
| Other values (27589) | 769819 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6866584 | ||
| E | 796771 | 6.4% |
| T | 436287 | 3.5% |
| A | 408734 | 3.3% |
| R | 339643 | 2.7% |
| N | 298626 | 2.4% |
| S | 285926 | 2.3% |
| 1 | 276924 | 2.2% |
| U | 203017 | 1.6% |
| V | 189426 | 1.5% |
| Other values (74) | 2440485 | 19.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Space Separator | 6866584 | |
| Uppercase Letter | 4106288 | |
| Decimal Number | 1448619 | 11.5% |
| Dash Punctuation | 81967 | 0.7% |
| Lowercase Letter | 24748 | 0.2% |
| Other Punctuation | 9582 | 0.1% |
| Open Punctuation | 2311 | < 0.1% |
| Close Punctuation | 2300 | < 0.1% |
| Modifier Symbol | 18 | < 0.1% |
| Connector Punctuation | 3 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 796771 | |
| T | 436287 | |
| A | 408734 | |
| R | 339643 | |
| N | 298626 | 7.3% |
| S | 285926 | 7.0% |
| U | 203017 | 4.9% |
| V | 189426 | 4.6% |
| O | 189076 | 4.6% |
| L | 142447 | 3.5% |
| Other values (16) | 816335 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4129 | |
| t | 2882 | |
| r | 2325 | |
| a | 2177 | 8.8% |
| n | 1624 | 6.6% |
| s | 1611 | 6.5% |
| o | 1310 | 5.3% |
| v | 1058 | 4.3% |
| d | 995 | 4.0% |
| l | 995 | 4.0% |
| Other values (16) | 5642 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6433 | |
| & | 1740 | 18.2% |
| . | 1001 | 10.4% |
| @ | 145 | 1.5% |
| , | 83 | 0.9% |
| : | 60 | 0.6% |
| # | 54 | 0.6% |
| ' | 50 | 0.5% |
| * | 8 | 0.1% |
| ? | 3 | < 0.1% |
| Other values (2) | 5 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 276924 | |
| 2 | 188118 | |
| 0 | 163217 | |
| 3 | 147759 | |
| 5 | 146349 | |
| 4 | 129563 | |
| 6 | 105739 | 7.3% |
| 7 | 103087 | 7.1% |
| 8 | 97604 | 6.7% |
| 9 | 90259 | 6.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2299 | |
| ] | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 1 | ||
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 6866584 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 81967 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2311 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 18 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8411387 | |
| Latin | 4131036 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 796771 | |
| T | 436287 | |
| A | 408734 | |
| R | 339643 | |
| N | 298626 | 7.2% |
| S | 285926 | 6.9% |
| U | 203017 | 4.9% |
| V | 189426 | 4.6% |
| O | 189076 | 4.6% |
| L | 142447 | 3.4% |
| Other values (42) | 841083 |
Common
| Value | Count | Frequency (%) |
| 6866584 | ||
| 1 | 276924 | 3.3% |
| 2 | 188118 | 2.2% |
| 0 | 163217 | 1.9% |
| 3 | 147759 | 1.8% |
| 5 | 146349 | 1.7% |
| 4 | 129563 | 1.5% |
| 6 | 105739 | 1.3% |
| 7 | 103087 | 1.2% |
| 8 | 97604 | 1.2% |
| Other values (22) | 186443 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12542423 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6866584 | ||
| E | 796771 | 6.4% |
| T | 436287 | 3.5% |
| A | 408734 | 3.3% |
| R | 339643 | 2.7% |
| N | 298626 | 2.4% |
| S | 285926 | 2.3% |
| 1 | 276924 | 2.2% |
| U | 203017 | 1.6% |
| V | 189426 | 1.5% |
| Other values (74) | 2440485 | 19.5% |
NUMBER OF PERSONS INJURED
Real number (ℝ)
ZEROS 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.30980159 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1601221 |
| Zeros (%) | 77.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.69996885 |
|---|---|
| Coefficient of variation (CV) | 2.2594102 |
| Kurtosis | 51.296075 |
| Mean | 0.30980159 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.2602307 |
| Sum | 642965 |
| Variance | 0.4899564 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1601221 | |
| 1 | 368039 | 17.7% |
| 2 | 69310 | 3.3% |
| 3 | 22649 | 1.1% |
| 4 | 8403 | 0.4% |
| 5 | 3225 | 0.2% |
| 6 | 1350 | 0.1% |
| 7 | 574 | < 0.1% |
| 8 | 252 | < 0.1% |
| 9 | 129 | < 0.1% |
| Other values (22) | 257 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1601221 | |
| 1 | 368039 | 17.7% |
| 2 | 69310 | 3.3% |
| 3 | 22649 | 1.1% |
| 4 | 8403 | 0.4% |
| 5 | 3225 | 0.2% |
| 6 | 1350 | 0.1% |
| 7 | 574 | < 0.1% |
| 8 | 252 | < 0.1% |
| 9 | 129 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 3 |
NUMBER OF PERSONS KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0014951363 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2072415 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.040773863 |
|---|---|
| Coefficient of variation (CV) | 27.271 |
| Kurtosis | 1937.399 |
| Mean | 0.0014951363 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 33.717434 |
| Sum | 3103 |
| Variance | 0.0016625079 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2072415 | |
| 1 | 2889 | 0.1% |
| 2 | 74 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| (Missing) | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2072415 | |
| 1 | 2889 | 0.1% |
| 2 | 74 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 3 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 74 | < 0.1% |
| 1 | 2889 | 0.1% |
| 0 | 2072415 |
NUMBER OF PEDESTRIANS INJURED
Real number (ℝ)
ZEROS 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.056549327 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 1962919 |
| Zeros (%) | 94.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2440835 |
|---|---|
| Coefficient of variation (CV) | 4.3162936 |
| Kurtosis | 129.0936 |
| Mean | 0.056549327 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.6862516 |
| Sum | 117364 |
| Variance | 0.059576754 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1962919 | |
| 1 | 108371 | 5.2% |
| 2 | 3663 | 0.2% |
| 3 | 365 | < 0.1% |
| 4 | 60 | < 0.1% |
| 5 | 26 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1962919 | |
| 1 | 108371 | 5.2% |
| 2 | 3663 | 0.2% |
| 3 | 365 | < 0.1% |
| 4 | 60 | < 0.1% |
| 5 | 26 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 26 | |
| 4 | 60 |
NUMBER OF PEDESTRIANS KILLED
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 MiB |
| 0 | |
|---|---|
| 1 | 1509 |
| 2 | 12 |
| 6 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2075427 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2073905 | |
| 1 | 1509 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2073905 | |
| 1 | 1509 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2073905 | |
| 1 | 1509 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2075427 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2073905 | |
| 1 | 1509 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2075427 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2073905 | |
| 1 | 1509 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2075427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2073905 | |
| 1 | 1509 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
NUMBER OF CYCLIST INJURED
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 MiB |
| 0 | |
|---|---|
| 1 | 54340 |
| 2 | 600 |
| 3 | 23 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2075427 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2020463 | |
| 1 | 54340 | 2.6% |
| 2 | 600 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2020463 | |
| 1 | 54340 | 2.6% |
| 2 | 600 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2020463 | |
| 1 | 54340 | 2.6% |
| 2 | 600 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2075427 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2020463 | |
| 1 | 54340 | 2.6% |
| 2 | 600 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2075427 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2020463 | |
| 1 | 54340 | 2.6% |
| 2 | 600 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2075427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2020463 | |
| 1 | 54340 | 2.6% |
| 2 | 600 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
NUMBER OF CYCLIST KILLED
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.8 MiB |
| 0 | |
|---|---|
| 1 | 237 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2075427 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2075189 | |
| 1 | 237 | < 0.1% |
| 2 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2075189 | |
| 1 | 237 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2075189 | |
| 1 | 237 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2075427 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2075189 | |
| 1 | 237 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2075427 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2075189 | |
| 1 | 237 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2075427 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2075189 | |
| 1 | 237 | < 0.1% |
| 2 | 1 | < 0.1% |
NUMBER OF MOTORIST INJURED
Real number (ℝ)
ZEROS 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22282162 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1772939 |
| Zeros (%) | 85.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.66109218 |
|---|---|
| Coefficient of variation (CV) | 2.9669122 |
| Kurtosis | 63.717057 |
| Mean | 0.22282162 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.1266596 |
| Sum | 462450 |
| Variance | 0.43704287 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1772939 | |
| 1 | 203426 | 9.8% |
| 2 | 63230 | 3.0% |
| 3 | 21961 | 1.1% |
| 4 | 8230 | 0.4% |
| 5 | 3175 | 0.2% |
| 6 | 1304 | 0.1% |
| 7 | 548 | < 0.1% |
| 8 | 245 | < 0.1% |
| 9 | 123 | < 0.1% |
| Other values (21) | 246 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1772939 | |
| 1 | 203426 | 9.8% |
| 2 | 63230 | 3.0% |
| 3 | 21961 | 1.1% |
| 4 | 8230 | 0.4% |
| 5 | 3175 | 0.2% |
| 6 | 1304 | 0.1% |
| 7 | 548 | < 0.1% |
| 8 | 245 | < 0.1% |
| 9 | 123 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 2 | |
| 21 | 1 | < 0.1% |
NUMBER OF MOTORIST KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00061529507 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 2074246 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.027135542 |
|---|---|
| Coefficient of variation (CV) | 44.101673 |
| Kurtosis | 4230.0939 |
| Mean | 0.00061529507 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 54.744147 |
| Sum | 1277 |
| Variance | 0.00073633763 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2074246 | |
| 1 | 1107 | 0.1% |
| 2 | 58 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2074246 | |
| 1 | 1107 | 0.1% |
| 2 | 58 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 58 | < 0.1% |
| 1 | 1107 | 0.1% |
| 0 | 2074246 |
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6802 |
| Missing (%) | 0.3% |
| Memory size | 15.8 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 43 |
| Mean length | 19.504495 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40347485 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aggressive Driving/Road Rage |
|---|---|
| 2nd row | Pavement Slippery |
| 3rd row | Following Too Closely |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 706732 | |
| driver | 447768 | 10.9% |
| inattention/distraction | 415252 | 10.1% |
| too | 162593 | 3.9% |
| closely | 162593 | 3.9% |
| to | 148089 | 3.6% |
| failure | 129495 | 3.1% |
| yield | 123304 | 3.0% |
| right-of-way | 123304 | 3.0% |
| following | 110930 | 2.7% |
| Other values (96) | 1591210 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4541258 | 11.3% |
| e | 4110099 | 10.2% |
| n | 3507152 | 8.7% |
| t | 2798284 | 6.9% |
| o | 2379399 | 5.9% |
| r | 2368411 | 5.9% |
| s | 2097469 | 5.2% |
| 2052645 | 5.1% | |
| a | 1989702 | 4.9% |
| c | 1555120 | 3.9% |
| Other values (45) | 12947946 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32952997 | |
| Uppercase Letter | 4563483 | 11.3% |
| Space Separator | 2052645 | 5.1% |
| Other Punctuation | 525436 | 1.3% |
| Dash Punctuation | 248356 | 0.6% |
| Open Punctuation | 2178 | < 0.1% |
| Close Punctuation | 2178 | < 0.1% |
| Decimal Number | 212 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4541258 | |
| e | 4110099 | |
| n | 3507152 | |
| t | 2798284 | |
| o | 2379399 | 7.2% |
| r | 2368411 | 7.2% |
| s | 2097469 | 6.4% |
| a | 1989702 | 6.0% |
| c | 1555120 | 4.7% |
| l | 1247725 | 3.8% |
| Other values (15) | 6358378 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1008742 | |
| U | 932875 | |
| I | 589426 | |
| F | 296057 | 6.5% |
| C | 284991 | 6.2% |
| T | 254554 | 5.6% |
| P | 184921 | 4.1% |
| R | 169001 | 3.7% |
| L | 134045 | 2.9% |
| W | 124428 | 2.7% |
| Other values (12) | 584443 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 101 | |
| 0 | 101 | |
| 1 | 10 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 2052645 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 525436 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 248356 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2178 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2178 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37516480 | |
| Common | 2831005 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4541258 | |
| e | 4110099 | 11.0% |
| n | 3507152 | 9.3% |
| t | 2798284 | 7.5% |
| o | 2379399 | 6.3% |
| r | 2368411 | 6.3% |
| s | 2097469 | 5.6% |
| a | 1989702 | 5.3% |
| c | 1555120 | 4.1% |
| l | 1247725 | 3.3% |
| Other values (37) | 10921861 |
Common
| Value | Count | Frequency (%) |
| 2052645 | ||
| / | 525436 | 18.6% |
| - | 248356 | 8.8% |
| ( | 2178 | 0.1% |
| ) | 2178 | 0.1% |
| 8 | 101 | < 0.1% |
| 0 | 101 | < 0.1% |
| 1 | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40347485 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4541258 | 11.3% |
| e | 4110099 | 10.2% |
| n | 3507152 | 8.7% |
| t | 2798284 | 6.9% |
| o | 2379399 | 5.9% |
| r | 2368411 | 5.9% |
| s | 2097469 | 5.2% |
| 2052645 | 5.1% | |
| a | 1989702 | 4.9% |
| c | 1555120 | 3.9% |
| Other values (45) | 12947946 |
CONTRIBUTING FACTOR VEHICLE 2
Text
MISSING 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 321736 |
| Missing (%) | 15.5% |
| Memory size | 15.8 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.048611 |
| Min length | 1 |
Characters and Unicode
| Total characters | 22883231 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 1476469 | |
| driver | 100961 | 4.7% |
| inattention/distraction | 94252 | 4.4% |
| other | 33129 | 1.5% |
| vehicular | 32066 | 1.5% |
| too | 27733 | 1.3% |
| closely | 27733 | 1.3% |
| passing | 21554 | 1.0% |
| to | 21532 | 1.0% |
| lane | 20107 | 0.9% |
| Other values (96) | 295716 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3607436 | |
| e | 3511207 | |
| n | 2050908 | |
| s | 1757954 | |
| c | 1666318 | |
| d | 1549984 | |
| p | 1546225 | |
| f | 1532577 | |
| U | 1512982 | |
| t | 619191 | 2.7% |
| Other values (45) | 3528449 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20077874 | |
| Uppercase Letter | 2253317 | 9.8% |
| Space Separator | 397561 | 1.7% |
| Other Punctuation | 118972 | 0.5% |
| Dash Punctuation | 34874 | 0.2% |
| Open Punctuation | 292 | < 0.1% |
| Close Punctuation | 292 | < 0.1% |
| Decimal Number | 49 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3607436 | |
| e | 3511207 | |
| n | 2050908 | |
| s | 1757954 | |
| c | 1666318 | |
| d | 1549984 | |
| p | 1546225 | |
| f | 1532577 | |
| t | 619191 | 3.1% |
| r | 540460 | 2.7% |
| Other values (15) | 1695614 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1512982 | |
| D | 224332 | 10.0% |
| I | 126451 | 5.6% |
| C | 52660 | 2.3% |
| F | 48383 | 2.1% |
| T | 44565 | 2.0% |
| O | 44234 | 2.0% |
| V | 41362 | 1.8% |
| P | 37413 | 1.7% |
| L | 28576 | 1.3% |
| Other values (12) | 92359 | 4.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 22 | |
| 0 | 22 | |
| 1 | 5 | 10.2% |
Space Separator
| Value | Count | Frequency (%) |
| 397561 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 118972 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34874 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 292 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 292 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22331191 | |
| Common | 552040 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3607436 | |
| e | 3511207 | |
| n | 2050908 | |
| s | 1757954 | |
| c | 1666318 | |
| d | 1549984 | |
| p | 1546225 | |
| f | 1532577 | |
| U | 1512982 | |
| t | 619191 | 2.8% |
| Other values (37) | 2976409 |
Common
| Value | Count | Frequency (%) |
| 397561 | ||
| / | 118972 | 21.6% |
| - | 34874 | 6.3% |
| ( | 292 | 0.1% |
| ) | 292 | 0.1% |
| 8 | 22 | < 0.1% |
| 0 | 22 | < 0.1% |
| 1 | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22883231 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3607436 | |
| e | 3511207 | |
| n | 2050908 | |
| s | 1757954 | |
| c | 1666318 | |
| d | 1549984 | |
| p | 1546225 | |
| f | 1532577 | |
| U | 1512982 | |
| t | 619191 | 2.7% |
| Other values (45) | 3528449 |
CONTRIBUTING FACTOR VEHICLE 3
Text
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1927163 |
| Missing (%) | 92.9% |
| Memory size | 15.8 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 11.656053 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1728173 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 138219 | |
| other | 2813 | 1.7% |
| vehicular | 2773 | 1.7% |
| driver | 2131 | 1.3% |
| too | 2011 | 1.2% |
| closely | 2011 | 1.2% |
| following | 1957 | 1.2% |
| inattention/distraction | 1950 | 1.2% |
| fatigued/drowsy | 853 | 0.5% |
| pavement | 410 | 0.3% |
| Other values (79) | 5908 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 295337 | |
| i | 294017 | |
| n | 151554 | |
| s | 145163 | |
| c | 144599 | |
| d | 140321 | |
| p | 139879 | |
| f | 139124 | |
| U | 138882 | |
| o | 17264 | 1.0% |
| Other values (45) | 122033 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1548193 | |
| Uppercase Letter | 163776 | 9.5% |
| Space Separator | 12772 | 0.7% |
| Other Punctuation | 3092 | 0.2% |
| Dash Punctuation | 309 | < 0.1% |
| Open Punctuation | 12 | < 0.1% |
| Close Punctuation | 12 | < 0.1% |
| Decimal Number | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 295337 | |
| i | 294017 | |
| n | 151554 | |
| s | 145163 | |
| c | 144599 | |
| d | 140321 | |
| p | 139879 | |
| f | 139124 | |
| o | 17264 | 1.1% |
| t | 16015 | 1.0% |
| Other values (15) | 64920 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 138882 | |
| D | 5536 | 3.4% |
| O | 3140 | 1.9% |
| V | 3060 | 1.9% |
| F | 3053 | 1.9% |
| C | 2492 | 1.5% |
| I | 2472 | 1.5% |
| T | 2268 | 1.4% |
| P | 703 | 0.4% |
| S | 561 | 0.3% |
| Other values (12) | 1609 | 1.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3 | |
| 0 | 3 | |
| 1 | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 12772 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3092 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 309 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1711969 | |
| Common | 16204 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 295337 | |
| i | 294017 | |
| n | 151554 | |
| s | 145163 | |
| c | 144599 | |
| d | 140321 | |
| p | 139879 | |
| f | 139124 | |
| U | 138882 | |
| o | 17264 | 1.0% |
| Other values (37) | 105829 | 6.2% |
Common
| Value | Count | Frequency (%) |
| 12772 | ||
| / | 3092 | 19.1% |
| - | 309 | 1.9% |
| ( | 12 | 0.1% |
| ) | 12 | 0.1% |
| 8 | 3 | < 0.1% |
| 0 | 3 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1728173 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 295337 | |
| i | 294017 | |
| n | 151554 | |
| s | 145163 | |
| c | 144599 | |
| d | 140321 | |
| p | 139879 | |
| f | 139124 | |
| U | 138882 | |
| o | 17264 | 1.0% |
| Other values (45) | 122033 |
CONTRIBUTING FACTOR VEHICLE 4
Categorical
IMBALANCE  MISSING 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2041953 |
| Missing (%) | 98.4% |
| Memory size | 15.8 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 614 |
| Following Too Closely | 390 |
| Driver Inattention/Distraction | 275 |
| Fatigued/Drowsy | 170 |
| Other values (36) | 448 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.489425 |
| Min length | 5 |
Characters and Unicode
| Total characters | 384597 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 31577 | 1.5% |
| Other Vehicular | 614 | < 0.1% |
| Following Too Closely | 390 | < 0.1% |
| Driver Inattention/Distraction | 275 | < 0.1% |
| Fatigued/Drowsy | 170 | < 0.1% |
| Pavement Slippery | 116 | < 0.1% |
| Reaction to Uninvolved Vehicle | 41 | < 0.1% |
| Unsafe Speed | 32 | < 0.1% |
| Outside Car Distraction | 28 | < 0.1% |
| Driver Inexperience | 27 | < 0.1% |
| Other values (31) | 204 | < 0.1% |
| (Missing) | 2041953 |
Length
| Value | Count | Frequency (%) |
| unspecified | 31577 | |
| other | 623 | 1.7% |
| vehicular | 614 | 1.7% |
| too | 395 | 1.1% |
| closely | 395 | 1.1% |
| following | 390 | 1.1% |
| driver | 302 | 0.8% |
| inattention/distraction | 275 | 0.8% |
| fatigued/drowsy | 170 | 0.5% |
| pavement | 119 | 0.3% |
| Other values (64) | 965 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 66721 | |
| i | 66107 | |
| n | 33651 | |
| c | 32739 | |
| s | 32723 | |
| p | 31939 | |
| d | 31931 | |
| f | 31705 | |
| U | 31684 | |
| o | 3077 | 0.8% |
| Other values (41) | 22320 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 345478 | |
| Uppercase Letter | 36244 | 9.4% |
| Space Separator | 2351 | 0.6% |
| Other Punctuation | 482 | 0.1% |
| Dash Punctuation | 34 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 66721 | |
| i | 66107 | |
| n | 33651 | |
| c | 32739 | |
| s | 32723 | |
| p | 31939 | |
| d | 31931 | |
| f | 31705 | |
| o | 3077 | 0.9% |
| r | 2766 | 0.8% |
| Other values (15) | 12119 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 31684 | |
| D | 858 | 2.4% |
| O | 677 | 1.9% |
| V | 661 | 1.8% |
| F | 604 | 1.7% |
| C | 460 | 1.3% |
| T | 425 | 1.2% |
| I | 349 | 1.0% |
| S | 149 | 0.4% |
| P | 145 | 0.4% |
| Other values (11) | 232 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 2351 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 482 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 381722 | |
| Common | 2875 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 66721 | |
| i | 66107 | |
| n | 33651 | |
| c | 32739 | |
| s | 32723 | |
| p | 31939 | |
| d | 31931 | |
| f | 31705 | |
| U | 31684 | |
| o | 3077 | 0.8% |
| Other values (36) | 19445 | 5.1% |
Common
| Value | Count | Frequency (%) |
| 2351 | ||
| / | 482 | 16.8% |
| - | 34 | 1.2% |
| ( | 4 | 0.1% |
| ) | 4 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 384597 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 66721 | |
| i | 66107 | |
| n | 33651 | |
| c | 32739 | |
| s | 32723 | |
| p | 31939 | |
| d | 31931 | |
| f | 31705 | |
| U | 31684 | |
| o | 3077 | 0.8% |
| Other values (41) | 22320 | 5.8% |
CONTRIBUTING FACTOR VEHICLE 5
Categorical
IMBALANCE  MISSING 
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2066358 |
| Missing (%) | 99.6% |
| Memory size | 15.8 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 178 |
| Following Too Closely | 98 |
| Driver Inattention/Distraction | 64 |
| Pavement Slippery | 49 |
| Other values (25) | 131 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.468078 |
| Min length | 5 |
Characters and Unicode
| Total characters | 104004 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 8549 | 0.4% |
| Other Vehicular | 178 | < 0.1% |
| Following Too Closely | 98 | < 0.1% |
| Driver Inattention/Distraction | 64 | < 0.1% |
| Pavement Slippery | 49 | < 0.1% |
| Fatigued/Drowsy | 41 | < 0.1% |
| Reaction to Uninvolved Vehicle | 12 | < 0.1% |
| Alcohol Involvement | 11 | < 0.1% |
| Obstruction/Debris | 10 | < 0.1% |
| Driver Inexperience | 10 | < 0.1% |
| Other values (20) | 47 | < 0.1% |
| (Missing) | 2066358 |
Length
| Value | Count | Frequency (%) |
| unspecified | 8549 | |
| other | 180 | 1.9% |
| vehicular | 178 | 1.8% |
| too | 100 | 1.0% |
| closely | 100 | 1.0% |
| following | 98 | 1.0% |
| driver | 74 | 0.8% |
| inattention/distraction | 64 | 0.7% |
| pavement | 50 | 0.5% |
| slippery | 49 | 0.5% |
| Other values (47) | 251 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 18109 | |
| i | 17868 | |
| n | 9076 | |
| c | 8869 | |
| s | 8820 | |
| p | 8675 | |
| d | 8634 | |
| f | 8576 | |
| U | 8572 | |
| o | 781 | 0.8% |
| Other values (40) | 6024 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93452 | |
| Uppercase Letter | 9795 | 9.4% |
| Space Separator | 624 | 0.6% |
| Other Punctuation | 118 | 0.1% |
| Dash Punctuation | 11 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18109 | |
| i | 17868 | |
| n | 9076 | |
| c | 8869 | |
| s | 8820 | |
| p | 8675 | |
| d | 8634 | |
| f | 8576 | |
| o | 781 | 0.8% |
| r | 748 | 0.8% |
| Other values (15) | 3296 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 8572 | |
| D | 208 | 2.1% |
| O | 198 | 2.0% |
| V | 191 | 1.9% |
| F | 151 | 1.5% |
| C | 112 | 1.1% |
| T | 106 | 1.1% |
| I | 89 | 0.9% |
| S | 59 | 0.6% |
| P | 53 | 0.5% |
| Other values (10) | 56 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 624 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 118 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 103247 | |
| Common | 757 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 18109 | |
| i | 17868 | |
| n | 9076 | |
| c | 8869 | |
| s | 8820 | |
| p | 8675 | |
| d | 8634 | |
| f | 8576 | |
| U | 8572 | |
| o | 781 | 0.8% |
| Other values (35) | 5267 | 5.1% |
Common
| Value | Count | Frequency (%) |
| 624 | ||
| / | 118 | 15.6% |
| - | 11 | 1.5% |
| ( | 2 | 0.3% |
| ) | 2 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104004 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 18109 | |
| i | 17868 | |
| n | 9076 | |
| c | 8869 | |
| s | 8820 | |
| p | 8675 | |
| d | 8634 | |
| f | 8576 | |
| U | 8572 | |
| o | 781 | 0.8% |
| Other values (40) | 6024 | 5.8% |
COLLISION_ID
Real number (ℝ)
UNIQUE 
| Distinct | 2075427 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3159627 |
| Minimum | 22 |
|---|---|
| Maximum | 4712252 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.8 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 104625.3 |
| Q1 | 3154976.5 |
| median | 3673954 |
| Q3 | 4193057.5 |
| 95-th percentile | 4608219.7 |
| Maximum | 4712252 |
| Range | 4712230 |
| Interquartile range (IQR) | 1038081 |
Descriptive statistics
| Standard deviation | 1505149.9 |
|---|---|
| Coefficient of variation (CV) | 0.47636949 |
| Kurtosis | -0.032800807 |
| Mean | 3159627 |
| Median Absolute Deviation (MAD) | 519041 |
| Skewness | -1.2236319 |
| Sum | 6.5575751 × 1012 |
| Variance | 2.2654762 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4455765 | 1 | < 0.1% |
| 3176288 | 1 | < 0.1% |
| 3188747 | 1 | < 0.1% |
| 3176436 | 1 | < 0.1% |
| 3189909 | 1 | < 0.1% |
| 3187402 | 1 | < 0.1% |
| 3178392 | 1 | < 0.1% |
| 3183441 | 1 | < 0.1% |
| 3178566 | 1 | < 0.1% |
| 3185340 | 1 | < 0.1% |
| Other values (2075417) | 2075417 |
| Value | Count | Frequency (%) |
| 22 | 1 | |
| 23 | 1 | |
| 24 | 1 | |
| 25 | 1 | |
| 26 | 1 | |
| 27 | 1 | |
| 28 | 1 | |
| 29 | 1 | |
| 30 | 1 | |
| 31 | 1 |
| Value | Count | Frequency (%) |
| 4712252 | 1 | |
| 4712247 | 1 | |
| 4712246 | 1 | |
| 4712245 | 1 | |
| 4712242 | 1 | |
| 4712241 | 1 | |
| 4712237 | 1 | |
| 4712235 | 1 | |
| 4712232 | 1 | |
| 4712231 | 1 |
| Distinct | 1631 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 13691 |
| Missing (%) | 0.7% |
| Memory size | 15.8 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 35 |
| Mean length | 16.886453 |
| Min length | 1 |
Characters and Unicode
| Total characters | 34815408 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 989 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Dump |
| Value | Count | Frequency (%) |
| vehicle | 880306 | |
| utility | 633851 | |
| station | 633808 | |
| sedan | 619493 | |
| wagon/sport | 453517 | |
| passenger | 416219 | |
| 181665 | 3.7% | |
| wagon | 180354 | 3.7% |
| sport | 180291 | 3.7% |
| truck | 85920 | 1.8% |
| Other values (950) | 616060 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2832968 | 8.1% | |
| S | 2735641 | 7.9% |
| t | 2300987 | 6.6% |
| i | 1938153 | 5.6% |
| E | 1818931 | 5.2% |
| a | 1620452 | 4.7% |
| e | 1611200 | 4.6% |
| n | 1548461 | 4.4% |
| o | 1436044 | 4.1% |
| T | 1141718 | 3.3% |
| Other values (65) | 15830853 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15625515 | |
| Uppercase Letter | 15543247 | |
| Space Separator | 2832968 | 8.1% |
| Other Punctuation | 635237 | 1.8% |
| Decimal Number | 71018 | 0.2% |
| Dash Punctuation | 52188 | 0.1% |
| Open Punctuation | 27618 | 0.1% |
| Close Punctuation | 27613 | 0.1% |
| Modifier Symbol | 2 | < 0.1% |
| Other Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2735641 | |
| E | 1818931 | |
| T | 1141718 | 7.3% |
| I | 1052103 | 6.8% |
| V | 953509 | 6.1% |
| A | 875488 | 5.6% |
| N | 865396 | 5.6% |
| R | 723751 | 4.7% |
| U | 695980 | 4.5% |
| L | 667664 | 4.3% |
| Other values (16) | 4013066 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2300987 | |
| i | 1938153 | |
| a | 1620452 | |
| e | 1611200 | |
| n | 1548461 | |
| o | 1436044 | |
| l | 943636 | |
| d | 667892 | 4.3% |
| r | 626054 | 4.0% |
| c | 600619 | 3.8% |
| Other values (15) | 2332017 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 53411 | |
| 6 | 14403 | 20.3% |
| 2 | 2678 | 3.8% |
| 3 | 340 | 0.5% |
| 1 | 64 | 0.1% |
| 5 | 47 | 0.1% |
| 0 | 38 | 0.1% |
| 9 | 20 | < 0.1% |
| 8 | 10 | < 0.1% |
| 7 | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 635209 | |
| . | 13 | < 0.1% |
| # | 8 | < 0.1% |
| , | 3 | < 0.1% |
| ' | 2 | < 0.1% |
| & | 1 | < 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2832968 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52188 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 27618 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 27613 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| � | 1 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31168762 | |
| Common | 3646646 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2735641 | 8.8% |
| t | 2300987 | 7.4% |
| i | 1938153 | 6.2% |
| E | 1818931 | 5.8% |
| a | 1620452 | 5.2% |
| e | 1611200 | 5.2% |
| n | 1548461 | 5.0% |
| o | 1436044 | 4.6% |
| T | 1141718 | 3.7% |
| I | 1052103 | 3.4% |
| Other values (41) | 13965072 |
Common
| Value | Count | Frequency (%) |
| 2832968 | ||
| / | 635209 | 17.4% |
| 4 | 53411 | 1.5% |
| - | 52188 | 1.4% |
| ( | 27618 | 0.8% |
| ) | 27613 | 0.8% |
| 6 | 14403 | 0.4% |
| 2 | 2678 | 0.1% |
| 3 | 340 | < 0.1% |
| 1 | 64 | < 0.1% |
| Other values (14) | 154 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34815407 | |
| Specials | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2832968 | 8.1% | |
| S | 2735641 | 7.9% |
| t | 2300987 | 6.6% |
| i | 1938153 | 5.6% |
| E | 1818931 | 5.2% |
| a | 1620452 | 4.7% |
| e | 1611200 | 4.6% |
| n | 1548461 | 4.4% |
| o | 1436044 | 4.1% |
| T | 1141718 | 3.3% |
| Other values (64) | 15830852 |
Specials
| Value | Count | Frequency (%) |
| � | 1 |
MISSING 
| Distinct | 1819 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 396691 |
| Missing (%) | 19.1% |
| Memory size | 15.8 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 30 |
| Mean length | 16.08444 |
| Min length | 1 |
Characters and Unicode
| Total characters | 27001529 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1080 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Pick-up Truck |
| 3rd row | Sedan |
| 4th row | Tractor Truck Diesel |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 653746 | |
| utility | 466778 | |
| station | 466750 | |
| sedan | 435556 | |
| wagon/sport | 326546 | |
| passenger | 318612 | |
| 141501 | 3.7% | |
| wagon | 140256 | 3.7% |
| sport | 140204 | 3.7% |
| truck | 85272 | 2.2% |
| Other values (1009) | 655810 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2165263 | 8.0% | |
| S | 2031182 | 7.5% |
| t | 1665937 | 6.2% |
| E | 1438671 | 5.3% |
| i | 1431599 | 5.3% |
| e | 1189958 | 4.4% |
| a | 1165845 | 4.3% |
| n | 1107454 | 4.1% |
| o | 1060004 | 3.9% |
| T | 919371 | 3.4% |
| Other values (63) | 12826245 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12665972 | |
| Lowercase Letter | 11537150 | |
| Space Separator | 2165263 | 8.0% |
| Other Punctuation | 468122 | 1.7% |
| Decimal Number | 59185 | 0.2% |
| Dash Punctuation | 52534 | 0.2% |
| Open Punctuation | 26652 | 0.1% |
| Close Punctuation | 26649 | 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2031182 | |
| E | 1438671 | |
| T | 919371 | 7.3% |
| N | 869351 | 6.9% |
| I | 842146 | 6.6% |
| V | 720985 | 5.7% |
| A | 685486 | 5.4% |
| O | 587991 | 4.6% |
| U | 585305 | 4.6% |
| R | 578068 | 4.6% |
| Other values (16) | 3407416 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1665937 | |
| i | 1431599 | |
| e | 1189958 | |
| a | 1165845 | |
| n | 1107454 | |
| o | 1060004 | |
| l | 685508 | 5.9% |
| r | 488148 | 4.2% |
| d | 474301 | 4.1% |
| c | 467870 | 4.1% |
| Other values (15) | 1800526 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 43069 | |
| 6 | 13695 | 23.1% |
| 2 | 1960 | 3.3% |
| 3 | 307 | 0.5% |
| 0 | 57 | 0.1% |
| 1 | 47 | 0.1% |
| 5 | 30 | 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 7 | < 0.1% |
| 7 | 5 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 468100 | |
| . | 11 | < 0.1% |
| ' | 3 | < 0.1% |
| , | 3 | < 0.1% |
| ? | 2 | < 0.1% |
| # | 2 | < 0.1% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2165263 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52534 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26652 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 26649 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24203122 | |
| Common | 2798407 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 2031182 | 8.4% |
| t | 1665937 | 6.9% |
| E | 1438671 | 5.9% |
| i | 1431599 | 5.9% |
| e | 1189958 | 4.9% |
| a | 1165845 | 4.8% |
| n | 1107454 | 4.6% |
| o | 1060004 | 4.4% |
| T | 919371 | 3.8% |
| N | 869351 | 3.6% |
| Other values (41) | 11323750 |
Common
| Value | Count | Frequency (%) |
| 2165263 | ||
| / | 468100 | 16.7% |
| - | 52534 | 1.9% |
| 4 | 43069 | 1.5% |
| ( | 26652 | 1.0% |
| ) | 26649 | 1.0% |
| 6 | 13695 | 0.5% |
| 2 | 1960 | 0.1% |
| 3 | 307 | < 0.1% |
| 0 | 57 | < 0.1% |
| Other values (12) | 121 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27001529 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2165263 | 8.0% | |
| S | 2031182 | 7.5% |
| t | 1665937 | 6.2% |
| E | 1438671 | 5.3% |
| i | 1431599 | 5.3% |
| e | 1189958 | 4.4% |
| a | 1165845 | 4.3% |
| n | 1107454 | 4.1% |
| o | 1060004 | 3.9% |
| T | 919371 | 3.4% |
| Other values (63) | 12826245 |
MISSING 
| Distinct | 260 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1932530 |
| Missing (%) | 93.1% |
| Memory size | 15.8 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.679552 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2526355 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 152 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 64246 | |
| utility | 49457 | |
| station | 49455 | |
| sedan | 47158 | |
| wagon/sport | 36096 | |
| passenger | 27716 | |
| 13439 | 3.9% | |
| wagon | 13359 | 3.8% |
| sport | 13358 | 3.8% |
| truck | 4339 | 1.3% |
| Other values (216) | 28474 |
Most occurring characters
| Value | Count | Frequency (%) |
| 204635 | 8.1% | |
| S | 200575 | 7.9% |
| t | 181889 | 7.2% |
| i | 150270 | 5.9% |
| a | 122930 | 4.9% |
| e | 122469 | 4.8% |
| n | 120231 | 4.8% |
| E | 116403 | 4.6% |
| o | 111274 | 4.4% |
| T | 77028 | 3.0% |
| Other values (52) | 1118651 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1190232 | |
| Uppercase Letter | 1073474 | |
| Space Separator | 204635 | 8.1% |
| Other Punctuation | 49536 | 2.0% |
| Decimal Number | 3643 | 0.1% |
| Dash Punctuation | 3083 | 0.1% |
| Open Punctuation | 876 | < 0.1% |
| Close Punctuation | 876 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 200575 | |
| E | 116403 | |
| T | 77028 | 7.2% |
| I | 71404 | 6.7% |
| V | 67340 | 6.3% |
| N | 65717 | 6.1% |
| A | 57929 | 5.4% |
| U | 54497 | 5.1% |
| W | 52846 | 4.9% |
| O | 46586 | 4.3% |
| Other values (15) | 263149 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 181889 | |
| i | 150270 | |
| a | 122930 | |
| e | 122469 | |
| n | 120231 | |
| o | 111274 | |
| l | 73585 | |
| d | 50123 | 4.2% |
| r | 44674 | 3.8% |
| c | 43583 | 3.7% |
| Other values (14) | 169204 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2999 | |
| 6 | 442 | 12.1% |
| 2 | 185 | 5.1% |
| 3 | 11 | 0.3% |
| 1 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| 5 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 204635 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 49536 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3083 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 876 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2263706 | |
| Common | 262649 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 200575 | 8.9% |
| t | 181889 | 8.0% |
| i | 150270 | 6.6% |
| a | 122930 | 5.4% |
| e | 122469 | 5.4% |
| n | 120231 | 5.3% |
| E | 116403 | 5.1% |
| o | 111274 | 4.9% |
| T | 77028 | 3.4% |
| l | 73585 | 3.3% |
| Other values (39) | 987052 |
Common
| Value | Count | Frequency (%) |
| 204635 | ||
| / | 49536 | 18.9% |
| - | 3083 | 1.2% |
| 4 | 2999 | 1.1% |
| ( | 876 | 0.3% |
| ) | 876 | 0.3% |
| 6 | 442 | 0.2% |
| 2 | 185 | 0.1% |
| 3 | 11 | < 0.1% |
| 1 | 2 | < 0.1% |
| Other values (3) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2526355 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 204635 | 8.1% | |
| S | 200575 | 7.9% |
| t | 181889 | 7.2% |
| i | 150270 | 5.9% |
| a | 122930 | 4.9% |
| e | 122469 | 4.8% |
| n | 120231 | 4.8% |
| E | 116403 | 4.6% |
| o | 111274 | 4.4% |
| T | 77028 | 3.0% |
| Other values (52) | 1118651 |
MISSING 
| Distinct | 101 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2043115 |
| Missing (%) | 98.4% |
| Memory size | 15.8 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.97682 |
| Min length | 2 |
Characters and Unicode
| Total characters | 580867 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 45 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 14893 | |
| utility | 11719 | |
| station | 11719 | |
| sedan | 11398 | |
| wagon/sport | 8867 | |
| passenger | 5970 | |
| 2859 | 3.6% | |
| sport | 2852 | 3.6% |
| wagon | 2852 | 3.6% |
| truck | 798 | 1.0% |
| Other values (103) | 5046 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 46717 | 8.0% | |
| S | 46409 | 8.0% |
| t | 44549 | 7.7% |
| i | 36568 | 6.3% |
| a | 29793 | 5.1% |
| e | 29584 | 5.1% |
| n | 29274 | 5.0% |
| o | 27071 | 4.7% |
| E | 24669 | 4.2% |
| l | 17966 | 3.1% |
| Other values (47) | 248267 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 287894 | |
| Uppercase Letter | 232942 | |
| Space Separator | 46717 | 8.0% |
| Other Punctuation | 11726 | 2.0% |
| Decimal Number | 727 | 0.1% |
| Dash Punctuation | 633 | 0.1% |
| Open Punctuation | 114 | < 0.1% |
| Close Punctuation | 114 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 46409 | |
| E | 24669 | |
| T | 16064 | 6.9% |
| V | 15380 | 6.6% |
| I | 15047 | 6.5% |
| N | 13718 | 5.9% |
| U | 12610 | 5.4% |
| W | 12327 | 5.3% |
| A | 12215 | 5.2% |
| O | 9650 | 4.1% |
| Other values (14) | 54853 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 44549 | |
| i | 36568 | |
| a | 29793 | |
| e | 29584 | |
| n | 29274 | |
| o | 27071 | |
| l | 17966 | |
| d | 12038 | 4.2% |
| r | 10503 | 3.6% |
| c | 10277 | 3.6% |
| Other values (13) | 40271 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 624 | |
| 6 | 58 | 8.0% |
| 2 | 42 | 5.8% |
| 3 | 2 | 0.3% |
| 5 | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 46717 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 11726 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 633 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 114 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 114 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 520836 | |
| Common | 60031 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 46409 | 8.9% |
| t | 44549 | 8.6% |
| i | 36568 | 7.0% |
| a | 29793 | 5.7% |
| e | 29584 | 5.7% |
| n | 29274 | 5.6% |
| o | 27071 | 5.2% |
| E | 24669 | 4.7% |
| l | 17966 | 3.4% |
| T | 16064 | 3.1% |
| Other values (37) | 218889 |
Common
| Value | Count | Frequency (%) |
| 46717 | ||
| / | 11726 | 19.5% |
| - | 633 | 1.1% |
| 4 | 624 | 1.0% |
| ( | 114 | 0.2% |
| ) | 114 | 0.2% |
| 6 | 58 | 0.1% |
| 2 | 42 | 0.1% |
| 3 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 580867 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 46717 | 8.0% | |
| S | 46409 | 8.0% |
| t | 44549 | 7.7% |
| i | 36568 | 6.3% |
| a | 29793 | 5.1% |
| e | 29584 | 5.1% |
| n | 29274 | 5.0% |
| o | 27071 | 4.7% |
| E | 24669 | 4.2% |
| l | 17966 | 3.1% |
| Other values (47) | 248267 |
MISSING 
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2066635 |
| Missing (%) | 99.6% |
| Memory size | 15.8 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.214058 |
| Min length | 2 |
Characters and Unicode
| Total characters | 160138 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 31 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 4020 | |
| utility | 3326 | |
| station | 3326 | |
| sedan | 3182 | |
| wagon/sport | 2524 | |
| passenger | 1487 | 6.8% |
| 804 | 3.7% | |
| wagon | 804 | 3.7% |
| sport | 802 | 3.7% |
| truck | 245 | 1.1% |
| Other values (68) | 1196 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12934 | 8.1% | |
| S | 12722 | 7.9% |
| t | 12689 | 7.9% |
| i | 10410 | 6.5% |
| a | 8409 | 5.3% |
| e | 8354 | 5.2% |
| n | 8289 | 5.2% |
| o | 7724 | 4.8% |
| E | 6129 | 3.8% |
| l | 5114 | 3.2% |
| Other values (44) | 67364 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 81706 | |
| Uppercase Letter | 61773 | |
| Space Separator | 12934 | 8.1% |
| Other Punctuation | 3328 | 2.1% |
| Dash Punctuation | 190 | 0.1% |
| Decimal Number | 161 | 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Close Punctuation | 23 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 12722 | |
| E | 6129 | |
| T | 4507 | 7.3% |
| V | 4133 | 6.7% |
| I | 4008 | 6.5% |
| U | 3497 | 5.7% |
| N | 3429 | 5.6% |
| W | 3426 | 5.5% |
| A | 3211 | 5.2% |
| O | 2625 | 4.2% |
| Other values (13) | 14086 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 12689 | |
| i | 10410 | |
| a | 8409 | |
| e | 8354 | |
| n | 8289 | |
| o | 7724 | |
| l | 5114 | |
| d | 3328 | 4.1% |
| r | 2972 | 3.6% |
| c | 2965 | 3.6% |
| Other values (12) | 11452 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 133 | |
| 2 | 14 | 8.7% |
| 6 | 13 | 8.1% |
| 3 | 1 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 12934 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3328 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 190 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 143479 | |
| Common | 16659 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 12722 | 8.9% |
| t | 12689 | 8.8% |
| i | 10410 | 7.3% |
| a | 8409 | 5.9% |
| e | 8354 | 5.8% |
| n | 8289 | 5.8% |
| o | 7724 | 5.4% |
| E | 6129 | 4.3% |
| l | 5114 | 3.6% |
| T | 4507 | 3.1% |
| Other values (35) | 59132 |
Common
| Value | Count | Frequency (%) |
| 12934 | ||
| / | 3328 | 20.0% |
| - | 190 | 1.1% |
| 4 | 133 | 0.8% |
| ( | 23 | 0.1% |
| ) | 23 | 0.1% |
| 2 | 14 | 0.1% |
| 6 | 13 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 160138 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12934 | 8.1% | |
| S | 12722 | 7.9% |
| t | 12689 | 7.9% |
| i | 10410 | 6.5% |
| a | 8409 | 5.3% |
| e | 8354 | 5.2% |
| n | 8289 | 5.2% |
| o | 7724 | 4.8% |
| E | 6129 | 3.8% |
| l | 5114 | 3.2% |
| Other values (44) | 67364 |
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 09/11/2021 | 2:39 | NaN | NaN | NaN | NaN | NaN | WHITESTONE EXPRESSWAY | 20 AVENUE | NaN | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Aggressive Driving/Road Rage | Unspecified | NaN | NaN | NaN | 4455765 | Sedan | Sedan | NaN | NaN | NaN |
| 1 | 03/26/2022 | 11:45 | NaN | NaN | NaN | NaN | NaN | QUEENSBORO BRIDGE UPPER | NaN | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Pavement Slippery | NaN | NaN | NaN | NaN | 4513547 | Sedan | NaN | NaN | NaN | NaN |
| 2 | 06/29/2022 | 6:55 | NaN | NaN | NaN | NaN | NaN | THROGS NECK BRIDGE | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | Unspecified | NaN | NaN | NaN | 4541903 | Sedan | Pick-up Truck | NaN | NaN | NaN |
| 3 | 09/11/2021 | 9:35 | BROOKLYN | 11208 | 40.667202 | -73.866500 | (40.667202, -73.8665) | NaN | NaN | 1211 LORING AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4456314 | Sedan | NaN | NaN | NaN | NaN |
| 4 | 12/14/2021 | 8:13 | BROOKLYN | 11233 | 40.683304 | -73.917274 | (40.683304, -73.917274) | SARATOGA AVENUE | DECATUR STREET | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | 4486609 | NaN | NaN | NaN | NaN | NaN |
| 5 | 04/14/2021 | 12:47 | NaN | NaN | NaN | NaN | NaN | MAJOR DEEGAN EXPRESSWAY RAMP | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4407458 | Dump | Sedan | NaN | NaN | NaN |
| 6 | 12/14/2021 | 17:05 | NaN | NaN | 40.709183 | -73.956825 | (40.709183, -73.956825) | BROOKLYN QUEENS EXPRESSWAY | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4486555 | Sedan | Tractor Truck Diesel | NaN | NaN | NaN |
| 7 | 12/14/2021 | 8:17 | BRONX | 10475 | 40.868160 | -73.831480 | (40.86816, -73.83148) | NaN | NaN | 344 BAYCHESTER AVENUE | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4486660 | Sedan | Sedan | NaN | NaN | NaN |
| 8 | 12/14/2021 | 21:10 | BROOKLYN | 11207 | 40.671720 | -73.897100 | (40.67172, -73.8971) | NaN | NaN | 2047 PITKIN AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inexperience | Unspecified | NaN | NaN | NaN | 4487074 | Sedan | NaN | NaN | NaN | NaN |
| 9 | 12/14/2021 | 14:58 | MANHATTAN | 10017 | 40.751440 | -73.973970 | (40.75144, -73.97397) | 3 AVENUE | EAST 43 STREET | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4486519 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2075417 | 03/05/2024 | 20:40 | QUEENS | 11375 | 40.722622 | -73.849144 | (40.722622, -73.849144) | YELLOWSTONE BOULEVARD | GERARD PLACE | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4707384 | Sedan | Tractor Truck Diesel | NaN | NaN | NaN |
| 2075418 | 03/05/2024 | 7:30 | NaN | NaN | 40.772953 | -73.920280 | (40.772953, -73.92028) | 26 STREET | HOYT AVENUE NORTH | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Turning Improperly | Driver Inattention/Distraction | NaN | NaN | NaN | 4707737 | Box Truck | Garbage or Refuse | NaN | NaN | NaN |
| 2075419 | 03/05/2024 | 14:50 | NaN | NaN | 40.646000 | -73.971750 | (40.646, -73.97175) | CHURCH AVENUE | EAST 8 STREET | NaN | 2.0 | 0.0 | 2 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | 4707432 | NaN | NaN | NaN | NaN | NaN |
| 2075420 | 03/05/2024 | 14:00 | NaN | NaN | 40.722250 | -74.005920 | (40.72225, -74.00592) | CANAL STREET | AVENUE OF THE AMERICAS | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Following Too Closely | Following Too Closely | NaN | NaN | NaN | 4707476 | Sedan | NaN | NaN | NaN | NaN |
| 2075421 | 02/06/2024 | 12:37 | BROOKLYN | 11235 | 40.586670 | -73.966156 | (40.58667, -73.966156) | OCEAN PARKWAY | AVENUE Z | NaN | 1.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4707884 | E-Bike | NaN | NaN | NaN | NaN |
| 2075422 | 03/05/2024 | 17:22 | QUEENS | 11436 | 40.680477 | -73.792100 | (40.680477, -73.7921) | SUTPHIN BOULEVARD | FOCH BOULEVARD | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Failure to Yield Right-of-Way | Unspecified | NaN | NaN | NaN | 4707511 | Station Wagon/Sport Utility Vehicle | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2075423 | 03/05/2024 | 17:00 | BROOKLYN | 11204 | 40.610786 | -73.978820 | (40.610786, -73.97882) | NaN | NaN | 161 AVENUE O | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inexperience | Unspecified | Unspecified | Unspecified | NaN | 4707419 | Ambulance | PK | Van | PK | NaN |
| 2075424 | 03/03/2024 | 17:50 | NaN | NaN | 40.675053 | -73.947235 | (40.675053, -73.947235) | SAINT MARKS AVENUE | NaN | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Aggressive Driving/Road Rage | Unspecified | NaN | NaN | NaN | 4707855 | Station Wagon/Sport Utility Vehicle | PK | NaN | NaN | NaN |
| 2075425 | 03/05/2024 | 14:30 | BROOKLYN | 11207 | 40.677900 | -73.892586 | (40.6779, -73.892586) | MILLER AVENUE | FULTON STREET | NaN | 1.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | Pedestrian/Bicyclist/Other Pedestrian Error/Confusion | NaN | NaN | NaN | NaN | 4707872 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 2075426 | 03/05/2024 | 8:00 | QUEENS | 11385 | 40.706512 | -73.878136 | (40.706512, -73.878136) | EDSALL AVENUE | 73 STREET | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Failure to Yield Right-of-Way | Unspecified | NaN | NaN | NaN | 4707447 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |